(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property 
Organization 
Internationa] Bureau 

(43) International Publication Date 
22 January 2004 (22.01.2004) 




(10) International Publication Number 

PCT WO 2004/008761 Al 



(51) Internationa] Patent Classification 7 : 



H04N 7/12 



(21) International Application Number: 

PCT/US2003/021728 

(22) International Filing Date: 14 July 2003 (14.07.2003) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 

60/395,843 
60/395,874 
10/410,456 



15 July 2002 (15.07.2002) US 
15 July 2002 (15.07.2002) US 
9 April 2003 (09.04.2003) US 



(71) Applicant (for all designated States except US): THOM- 
SON LICENSING S.A. [FR/FR]; 46, Quai A. Le Gallo, 
F-92648 Boulogne (FR). 

(72) Inventor; and 

(75) Inventor/Applicant (for US only): BOYCE, Jill, Mac- 
Donald [US/US]; 3 Brandywine Court, Manalapan, NJ 
07726 (US). 



(74) Agents: TRIPOLI, Joseph, S. et al.; Thomson Licensing 
Inc., Two Independence Way, Suite #200, Princeton, NJ 
08540 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU 
CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NI, NO, NZ, OM, PG, PH, PL, PT, RO, RU, SC, 
SD, SE, SG, SK, SL, SY, TJ, TM, TN, TR, TT, TZ, UA, 
UG, US, UZ, VC, VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, BG, CH, CY, CZ, DE, DK, EE, 
ES, FI, FR, GB, GR, HU, EE, IT, LU, MC, NL, PT, RO, 
SE, SI, SK, TR), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, GQ, GW, ML, MR, NE, SN, TD, TG): 

Published: 

— with international search report 

For two-letter codes and otlter abbre\>iations. refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



=5 (54) Title: ADAPTIVE WEIGHTING OF REFERENCE PICTURES IN VIDEO ENCODING 



i 



310 



r 



320 



-300 



VLD 




INVERSE 




INVERSE 








QUANTIZE 


— > 


TRANSFORM 


\*J 



REF PIC INDEX 



380 
J_ 



REF PICTURE 
WEIGHTING 
FACTOR LOOKUP 



390 



•370 




00 



o 



REFERENCE 
PICTURE 
STORES 



o 
o 



LU 



0 



>- 




cc 




I — 








ZD 




O 




O 






CO 


cc 


LU 


I— 


CC 


^zl 


CC 


ZD 


o 


O 




O 



(57) Abstract: A video decoder ( Figure 3, 300), encoder (500), and corresponding methods for processing video signal data for 
an image block and a particular reference picture index to predict the image block are disclosed that utilize adaptive weighting of 
reference pictures to enhance video compression, where a decoder (300) includes a reference picture weighting factor unit (380) 
for determining a weighting factor corresponding to the particular reference picture index; an encoder (500) includes a reference 
picture weighting factor assignor (572) for assigning a weighting factor corresponding to the particular reference. picture index; and 
a method for decoding includes receiving a reference picture index with the data that corresponds to the image block, determining 
a weichling factor for each received reference picture index, retrieving a reference picture for each index, motion compensating the 
retrieved reference picture, and multiplying the motion compensated reference picture by the corresponding weighting factor to form 
a weighted motion compensated reference picture. 
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ADAPTIVE WEIGHTING OF REFERENCE PICTURES IN VIDEO ENCODING 



CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional Patent Application Serial 

5 No. 60/395,843 (Atty. Docket No. PU020340), entitled "Adaptive Weighting Of 

Reference Pictures In Video CODEC and filed July 15, 2002, which is incorporated 
by reference herein in its entirety. In addition, this application claims the benefit of 
U.S. Provisional Patent Application Serial No. 60/395,874 (Atty. Docket No. 
PU020339), entitled "Motion Estimation With Weighting Prediction" also filed July 15, 

10 2002, which is incorporated by reference herein in its entirety. 

FIELD OF THE INVENTION 

The present invention is directed towards video encoders and in particular, 
towards adaptive weighting of reference pictures in video encoders. 

15 

BACKGROUND OF THE INVENTION 

Video data is generally processed and transferred in the form of bit streams. 
Typical video compression coders and decoders ("CODECs") gain much of their 
compression efficiency by forming a reference picture prediction of a picture to be 
20 encoded, and encoding the difference between the current picture and the prediction. 
The more closely that the prediction is correlated with the current picture, the fewer 
bits that are needed to compress that picture, thereby increasing the efficiency of the 
process. Thus, it is desirable for the best possible reference picture prediction to be 
formed. 

25 In many video compression standards, including Moving Picture Experts 

Group ("MPEG")-1 , MPEG-2 and MPEG-4, a motion compensated version of a 
previous reference picture is used as a prediction for the current picture, and only the 
difference between the current picture and the prediction is coded. When a single 
picture prediction ("P" picture) is used, the reference picture is not scaled when the 

30 motion compensated prediction is formed. When bi-directional picture predictions 
("B" pictures) are used, intermediate predictions are formed from two different 
pictures, and then the two intermediate predictions are averaged together, using 
equal weighting factors of (Vz, Vz) for each, to form a single averaged prediction. In 



BNSDOCID: <W O 2004008761 A 1 I > 



WO 2004/008761 PCT/US2003/021728 

2 

these MPEG standards, the two reference pictures are always one each from the 
forward direction and the backward direction for B pictures. 

St 1MMARY OF THE INVENTION 
5 These and other drawbacks and disadvantages of the prior art are addressed 

by a system and method for adaptive weighting of reference pictures in video coders 
and decoders. 

A video encoder, and corresponding methods for processing video signal data 
for an image block and a particular reference picture index to predict the image block 

10 are disclosed that utilize adaptive weighting of reference pictures to enhance video 
compression. An encoder includes a reference picture weighting factor assignor for 
assigning a weighting factor to the particular reference picture index. 

A corresponding method for encoding video video signal data for an image 
block includes receiving a substantially uncompressed image block and assigning a 

15 weighting factor for the image block corresponding to a particular reference picture 
having a corresponding index. Motion vectors are computed corresponding to the 
difference between the image block and the particular reference picture. A particular 
reference picture is motion compensated in correspondence with the motion vectors 
and the motion compensated reference picture is modified by the assigned weighting 

20 factor to form a weighted motion compensated reference picture. The substantially 
uncompressed image block is compared to the weighted motion compensated 
reference picture, and a signal indicative of the difference between the substantially 
uncompressed image block and the weighted motion compensated reference picture 
along with the corresponding index of the particular reference picture is encoded. 

25 

RR1EF DESCRIPTION OF THE DRAWINGS 

Adaptive weighting of reference pictures in video coders and decoders in 
accordance with the principles of the present invention are shown in the following 
exemplary figures, in which: 
30 Figure 1 shows a block diagram for a standard video decoder; 

Figure 2 shows a block diagram for a video decoder with adaptive bi- 
prediction; 
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Figure 3 shows a block diagram for a video decoder with reference picture 
weighting in accordance with the principles of the present invention; 

Figure 4 shows a block diagram for a standard video encoder; 

Figure 5 shows a block diagram for a video encoder with reference picture 
5 weighting in accordance with the principles of the present invention; 

Figure 6 shows a flowchart for a decoding process in accordance with the 
principles of the present invention; and 

Figure 7 shows a flowchart for an encoding process in accordance with the 
principles of the present invention. 

10 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

The present invention presents an apparatus and method for motion vector 
estimation and adaptive reference picture weighting factor assignment. In some 
video sequences, in particular those with fading, the current picture or image block to 

15 be coded is more strongly correlated to a reference picture scaled by a weighting 
factor than to the reference picture itself. Video CODECs without weighting factors 
applied to reference pictures encode fading sequences very inefficiently. When 
weighting factors are used in encoding, a video encoder needs to determine both 
weighting factors and motion vectors, but the best choice for each of these depends 

20 on the other, with motion estimation typically being the most computationally 
intensive part of a digital video compression encoder. 

In the proposed Joint Video Team ("JVT") video compression standard, each P 
picture can use multiple reference pictures to form a picture's prediction, but each 
individual motion block or 8x8 region of a macroblock uses only a single reference 

25 picture for prediction. In addition to coding and transmitting the motion vectors, a 
reference picture index is transmitted for each motion block or 8x8 region, indicating 
which reference picture is used. A limited set of possible reference pictures is stored 
at both the encoder and decoder, and the number of allowable reference pictures is 
transmitted. 

30 In the JVT standard, for bi-predictive pictures (also called "B" pictures), two 

predictors are formed for each motion block or 8x8 region, each of which can be from 
a separate reference picture, and the two predictors are averaged together to form a 
single averaged predictor. For bi-predictively coded motion blocks, the reference 
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pictures can both be from the forward direction, both be from the backward direction, 
or one each from the forward and backward directions. Two lists are maintained of 
the available reference pictures that may used for prediction. The two reference 
pictures are referred to as the list 0 and list 1 predictors. An index for each reference 
picture is coded and transmitted, ref_idx_IO and ref_idx_M , for the list 0 and list 1 
reference pictures, respectively. Joint Video Team ("JVT") bi-predictive or "B" 
pictures allows adaptive weighting between the two predictions, i.e., 

Pred = [(P0)(Pred0)] + [(P1)(Pred1)] + D, 
where P0 and P1 are weighting factors, PredO and Predl are the reference picture 
predictions for list 0 and list 1 respectively, and D is an offset. 

Two methods have been proposed for indication of weighting factors. In the 
first, the weighting factors are determined by the directions that are used for the 
reference pictures. In this method, if the ref_idx_IO index is less than or equal to 
ref_idx_H , weighting factors of ( 1 / 2 , Vz ) are used, otherwise (2, -1) factors are used. 

In the second method offered, any number of weighting factors is transmitted 
for each slice. Then a weighting factor index is transmitted for each motion block or 
8x8 region of a macroblock that uses bi-directional prediction. The decoder uses the 
received weighting factor index to choose the appropriate weighting factor, from the 
transmitted set, to use when decoding the motion block or 8x8 region. For example, 
if three weighting factors were sent at the slice layer, they would correspond to weight 
factor indices 0, 1 and 2, respectively. 

The following description merely illustrates the principles of the invention. It • 
will thus be appreciated that those skilled in the art will be able to devise various 
arrangements that, although not explicitly described or shown herein, embody the 
principles of the invention and are included within its spirit and scope. Furthermore, 
all examples and conditional language recited herein are principally intended 
expressly to be only for pedagogical purposes to aid the reader in understanding the 
principles of the invention and the concepts contributed by the inventor to furthering 
the art, and are to be construed as being without limitation to such specifically recited 
examples and conditions. Moreover, all statements herein reciting principles, 
aspects, and embodiments of the invention, as well as specific examples thereof, are 
intended to encompass both structural and functional equivalents thereof. 
Additionally, it is intended that such equivalents include both currently known 
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equivalents as well as equivalents developed in the future, i.e., any elements 
developed that perform the same function, regardless of structure. 

Thus, for example, it will be appreciated by those skilled in the art that the 
block diagrams herein represent conceptual views of illustrative circuitry embodying 
5 the principles of the invention. Similarly, it will be appreciated that any flow charts, 
flow diagrams, state transition diagrams, pseudocode, and the like represent various 
processes which may be substantially represented in computer readable media and 
so executed by a computer or processor, whether or not such computer or processor 
is explicitly shown. 

10 - The functions of the various elements shown in the figures may be provided 
through the use of dedicated hardware as well as hardware capable of executing 
software in association with appropriate software. When provided by a processor, 
the functions may be provided by a single dedicated processor, by a single shared 
processor, or by a plurality of individual processors, some of which may be shared. 

15 Moreover, explicit use of the term "processor" or "controller" should not be construed 
to refer exclusively to hardware capable of executing software, and may implicitly 
include, without limitation, digital signal processor ("DSP") hardware, read-only 
memory ("ROM") for storing software, random access memory ("RAM"), and 
non-volatile storage. Other hardware, conventional and/or custom, may also be 

20 included. Similarly, any switches shown in the figures are conceptual only. Their 

function may be carried out through the operation of program logic, through dedicated 
logic, through the interaction of program control and dedicated logic, or even 
manually, the particular technique being selectable by the implementer as more 
specifically understood from the context. 

25 In the claims hereof any element expressed as a means for performing a 

specified function is intended to encompass any way of performing that function 
including, for example, a) a combination of circuit elements that performs that 
function or b) software in any form, including, therefore, firmware, microcode or the 
like, combined with appropriate circuitry for executing that software to perform the 

30 function. The invention as defined by such claims resides in the fact that the 
functionalities provided by the various recited means are combined and brought 
together in the manner which the claims call for. Applicant thus regards any means 
that can provide those functionalities as equivalent to those shown herein. 



BNSDOCID: <WO__2004008761A1_I_> 



WO 2004/008761 PCT/US2003/021728 

6 

As shown in Figure 1 , a standard video decoder is indicated generally by the 
reference numeral 100. The video decoder 100 includes a variable length decoder 
("VLD") 110 connected in signal communication with an inverse quantizer 120. The 
inverse quantizer 120 is connected in signal communication with an inverse 
transformer 130. The inverse transformer 130 is connected in signal communication 
with a first input terminal of an adder or summing junction 140, where the output of 
the summing junction 140 provides the output of the video decoder 100. The output 
of the summing junction 140 is connected in signal communication with a reference 
picture store 150. The reference picture store 150 is connected in signal 
communication with a motion compensator 160, which is connected in signal 
communication with a second input terminal of the summing junction 140. 

Turning to Figure 2, a video decoder with adaptive bi-prediction is indicated 
generally by the reference numeral 200. The video decoder 200 includes a VLD 210 
connected in signal communication with an inverse quantizer 220. The inverse 
quantizer 220 is connected in signal communication with an inverse transformer 230. 
The inverse transformer 230 is connected in signal communication with a first input 
terminal of a summing junction 240, where the output of the summing junction 240 
provides the output of the video decoder 200. The output of the summing junction 
240 is connected in signal communication with a reference picture store 250. The 
reference picture store 250 is connected in signal communication with a motion 
compensator 260, which is connected in signal communication with a first input of a 
multiplier 270. 

The VLD 210 is further connected in signal communication with a reference 
picture weighting factor lookup 280 for providing an adaptive bi-prediction ("ABP") 
coefficient index to the lookup 280. A first output of the lookup 280 is for providing a 
weighting factor, and is connected in signal communication to a second input of the 
multiplier 270. The output of the multiplier 270 is connected in signal communication 
to a first input of a summing junction 290. A second output of the lookup 280 is for 
providing an offset, and is connected in signal communication to a second input of 
the summing junction 290. The output of the summing junction 290 is connected in 
signal communication with a second input terminal of the summing junction 240. 

Turning now to Figure 3, a video decoder with reference picture weighting is 
indicated generally by the reference numeral 300. The video decoder 300 includes a 
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VLD 310 connected in signal communication with an inverse quantizer 320. The 
inverse quantizer 320 is connected in signal communication with an inverse 
transformer 330. The inverse transformer 330 is connected in signal communication 
with a first input terminal of a summing junction 340, where the output of the summing 

5 junction 340 provides the output of the video decoder 300. The output of the 

summing junction 340 is connected in signal communication with a reference picture 
store 350. The reference picture store 350 is connected in signal communication with 
a motion compensator 360, which is connected in signal communication with a first 
input of a multiplier 370. 

10 — The-VLD-310-is further connected in signal communication with a reference 
picture weighting factor lookup 380 for providing a reference picture index to the 
lookup 380. A first output of the lookup 380 is for providing a weighting factor, and is 
connected in signal communication to a second input of the multiplier 370. The 
output of the multiplier 370 is connected in signal communication to a first input of a 

15 summing junction 390. A second output of the lookup 380 is for providing an offset, 
and is connected in signal communication to a second input of the summing junction 
390. The output of the summing junction 390 is connected in signal communication 
with a second input terminal of the summing junction 340. 

As shown in Figure 4, a standard video encoder is indicated generally by the 

20 reference numeral 400. An input to the encoder 400 is connected in signal 

communication with a non-inverting input of a summing junction 410. The output of 
the summing junction 410 is connected in signal communication with a block 
transformer 420. The transformer 420 is connected in signal communication with a 
quantizer 430. The output of the quantizer 430 is connected in signal communication 

25 with a variable length coder ("VLC") 440, where the output of the VLC 440 is an 
externally available output of the encoder 400. 

The output of the quantizer 430 is further connected in signal communication 
with an inverse quantizer 450. The inverse quantizer 450 is connected in signal 
communication with an inverse block transformer 460, which, in turn, is connected in 

30 signal communication with a reference picture store 470. A first output of the 

reference picture store 470 is connected in signal communication with a first input of 
a motion estimator 480. The input to the encoder 400 is further connected in signal 
communication with a second input of the motion estimator 480. The output of the 
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motion estimator 480 is connected in signal communication with a first input of a 
motion compensator 490. A second output of the reference picture store 470 is 
connected in signal communication with a second input of the motion compensator 
490. The output of the motion compensator 490 is connected in signal 

5 communication with an inverting input of the summing junction 410. 

Turning to Figure 5, a video encoder with reference picture weighting is 
indicated generally by the reference numeral 500. An input to the encoder 500 is 
connected in signal communication with a non-inverting input of a summing junction 
510. The output of the summing junction 510 is connected in signal communication 

10 with a block transformer 520. The transformer 520 is connected in signal 

communication with a quantizer 530. The output of the quantizer 530 is connected in 
signal communication with a VLC 540, where the output of the VLC 440 is an 
externally available output of the encoder 500. 

The output of the quantizer 530 is further connected in signal communication 

15 with an inverse quantizer 550. The inverse quantizer 550 is connected in signal 

communication with an inverse block transformer 560, which, in turn, is connected in 
signal communication with a reference picture store 570. A first output of the 
reference picture store 570 is connected in signal communication with .a first input of 
a reference picture weighting factor assignor 572. The input to the encoder 500 is 

20 further connected in signal communication with a second input of the reference 

picture weighting factor assignor 572. The output of the reference picture weighting 
factor assignor 572, which is indicative of a weighting factor, is connected in signal 
communication with a first input of a motion estimator 580. A second output of the 
reference picture store 570 is connected in signal communication with a second input 

25 of the motion estimator 580. 

The input to the encoder 500 is further connected in signal communication with 
a third input of the motion estimator 580. The output of the motion estimator 580, 
which is indicative of motion vectors, is connected in signal communication with a f irst 
input of a motion compensator 590. A third output of the reference picture store 570 

30 is connected in signal communication with a second input of the motion compensator 
590. The output of the motion compensator 590, which is indicative of a motion 
compensated reference picture, is connected in signal communication with a first 
input of a multiplier 592. The output of the reference picture weighting factor assignor 
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572, which is indicative of a weighting factor, is connected in signal communication 
with a second input of the multiplier 592. The output of the multiplier 592 is 
connected in signal communication with an inverting input of the summing junction 
510. 

5 Turning now to Figure 6, an exemplary process for decoding video signal data 

for an image block is indicated generally by the reference numeral 600. The process 
includes a start block 610 that passes control to an input block 612. The input block 
612 receives the image block compressed data, and passes control to an input block 
614. The input block 614 receives at least one reference picture index with the data 

10 for the image block, each reference picture index corresponding to a particular 

reference picture. The input block 614 passes control to a function block 616, which 
determines a weighting factor corresponding to each of the received reference picture 
indices, and passes control to an optional function block 617. The optional function 
block 617 determines an offset corresponding to each of the received reference 

15 picture indices, and passes control to a function block 618. The function block 618 
retrieves a reference picture corresponding to each of the received reference picture 
indices, and passes control to a function block 620. The function block 620, in turn, 
motion compensates the retrieved reference picture, and passes control to a function 
block 622. The function block 622 multiplies the motion compensated reference 

20 picture by the corresponding weighting factor, and passes control to an optional 
function block 623. The optional function block 623 adds the motion compensated 
reference picture to the corresponding offset, and passes control to a function block 
624. The function block 624, in turn, forms a weighted motion compensated 
reference picture, and passes control to an end block 626. 

25 Turning now to Figure 7, an exemplary process for encoding video signal data 

for an image block is indicated generally by the reference numeral 700. The process 
includes a start block 710 that passes control to an input block 712. The input block 
712 receives substantially uncompressed image block data, and passes control to a 
function block 714. The function block 714 assigns a weighting factor for the image 

30 block corresponding to a particular reference picture having a corresponding index. 
The function block 714 passes control to an optional function block 715. The optional 
function block 715 assigns an offset for the image block corresponding to a particular 
reference picture having a corresponding index. The optional function block 715 
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passes control to a function block 716, which computes motion vectors corresponding 
to the difference between the image block and the particular reference picture, and 
passes control to a function block 718. The function block 718 motion compensates 
the particular reference picture in correspondence with the motion vectors, and 

5 passes control to a function block 720. The function block 720, in turn, multiplies the 
motion compensated reference picture by the assigned weighting factor to form a 
weighted motion compensated reference picture, and passes control to an optional 
function block 721 . The optional function block 721 , in turn, adds the motion 
compensated reference picture to the assigned offset to form a weighted motion 

10 compensated reference picture, and passes control to a function block 722. The 
function block 722 subtracts the weighted motion compensated reference picture 
from the substantially uncompressed image block, and passes control to a function 
block 724. The function block 724, in turn, encodes a signal with the difference 
between the substantially uncompressed image block and the weighted motion 

15 compensated reference picture along with the corresponding index of the particular 
re f erence picture, and passes control to an end block 726. 

In the present exemplary embodiment, for each coded picture or slice, a 
weighting factor is associated with each allowable reference picture that blocks of the 
current picture can be encoded with respect to. When each individual block in the 

20 current picture is encoded or decoded, the weighting factor(s) and offset(s) that 

correspond to its reference picture indices are applied to the reference prediction to 
form a weight predictor. All blocks in the slice that are coded with respect to the 
same reference picture apply the same weighting factor to the reference picture 
prediction. 

25 Whether or not to use adaptive weighting when coding a picture can be 

indicated in the picture parameter set or sequence parameter set, or in the slice or 
picture header. For each slice or picture that uses adaptive weighting, a weighting 
factor may be transmitted for each of the allowable reference pictures that may be 
used for encoding this slice or picture. The number of allowable reference pictures is 

30 transmitted in the slice header. For example, if three reference pictures can be used 
to encode the current slice, up to three weighting factors are transmitted, and they 
are associated with the reference picture with the same index. 
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If no weighting factors are transmitted, default weights are used. In one 
embodiment of the current invention, default weights of (Vz, Vz) are used when no 
weighting factors are transmitted. The weighting factors may be transmitted using 
either fixed or variable length codes. 

5 Unlike typical systems, each weighting factor that is transmitted with each 

slice, block or picture corresponds to a particular reference picture index. Previously, 
any set of weighting factors transmitted with each slice or picture were not associated 
with any particular reference pictures. Instead, an adaptive bi-prediction weighting 
index was transmitted for each motion block or 8x8 region to select which of the 

10 weighting factors from the transmitted set was to be applied for that particular motion 
block or 8x8 region. 

In the present embodiment, the weighting factor index for each motion block or 
8x8 region is not explicitly transmitted. Instead, the weighting factor that is 
associated with the transmitted reference picture index is used. This dramatically 

15 reduces the amount of overhead in the transmitted bitstream to allow adaptive 
weighting of reference pictures. 

This system and technique may be applied to either Predictive "P" pictures, 
which are encoded with a single predictor, or to Bi-predictive "B" pictures, which are 
encoded with two predictors. The decoding processes, which are present in both 

20 encoder and decoders, are described below for the P and B picture cases. 
Alternatively, this technique may also be applied to coding systems using the 
concepts similar to I, B, and P pictures. 

The same weighting factors can be used for single directional prediction in B 
pictures and for bi-directional prediction in B pictures. When a single predictor is used 

25 for a macroblock, in P pictures or for single directional prediction in B pictures, a 

single reference picture index is transmitted for the block. After the decoding process 
step of motion compensation produces a predictor, the weighting factor is applied to 
predictor. The weighted predictor is then added to the coded residual, and clipping is 
performed on the sum, to form the decoded picture. For use for blocks in P pictures 

30 or for blocks in B pictures that use only list 0 prediction, the weighted predictor is 
formed as: 

Pred = W0 * PredO + DO (1) 
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where WO is the weighting factor associated with the list 0 reference picture, 
DO is the offset associated with the list 0 reference picture, and PredO is the motion- 
compensated prediction block from the list 0 reference picture. 
5 For use for blocks in B pictures which use only list 0 prediction, the weighted 

predictor is formed as: 

Pred = W1 * Predl + D1 (2) 

10 where W1 is the weighting factor associated with the list 1 reference picture, 

DO is the offset associated with the list 1 reference picture, and Predl is the motion- 
compensated prediction block from the list 1 reference picture. 

The weighted predictors may be clipped to guarantee that the resulting values 
will be within the allowable range of pixel values, typically 0 to 255. The precision of 
15 the multiplication in the weighting formulas may be limited to any pre-determined 
number of bits of resolution. 

In the bi-predictive case, reference picture indexes are transmitted for each of 
the two predictors. Motion compensation is performed to form the two predictors. 
Each predictor uses the weighting factor associated with its reference picture index to 
20 form two weighted predictors. The two weighted predictors are then averaged 

together to form an averaged predictor, which is then added to the coded residual.. 

For use for blocks in B pictures that use list 0 and list 1 predictions, the 
weighted predictor is formed as: 

25 Pred = (P0 * PredO + DO + P1 * Predl + D1 )/2 (3) 

Clipping may be applied to the weighted predictor or any of the intermediate 
values in the calculation of the weighted predictor to guarantee that the resulting 
values will be within the allowable range of pixel values, typically 0 to 255. 
30 Thus, a weighting factor is applied to the reference picture prediction of a 

video compression encoder and decoder that uses multiple reference pictures. The 
weighting factor adapts for individual motion blocks within a picture, based on the 
reference picture index that is used for that motion block. Because the reference 
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picture index is already transmitted in the compressed video bitstream, the additional 
overhead to adapt the weighting factor on a motion block basis is dramatically 
reduced. All motion blocks that are coded with respect to the same reference picture 
apply the same weighting factor to the reference picture prediction. 

5 These and other features and advantages of the present invention may be 

readily ascertained by one of ordinary skill in the pertinent art based on the teachings 
herein. It is to be understood that the teachings of the present invention may be 
implemented in various forms of hardware, software, firmware, special purpose 
processors, or combinations thereof. 

10 Most preferably, the teachings of the present invention are implemented as a 

combination of hardware and software. Moreover, the software is preferably 
implemented as an application program tangibly embodied on a program storage 
unit. The application program may be uploaded to, and executed by, a machine 
comprising any suitable architecture. Preferably, the machine is implemented on a 

15 computer platform having hardware such as one or more central processing units 
("CPU"), a random access memory ("RAM"), and input/output ("I/O") interfaces. The 
computer platform may also include an operating system and microinstruction code. 
The various processes and functions described herein may be either part of the 
microinstruction code or part of the application program, or any combination thereof, 

20 which may be executed by a CPU. In addition, various other peripheral units may be 
connected to the computer platform such as an additional data storage unit and a 
printing unit. 

It is to be further understood that, because some of the constituent system 
components and methods depicted in the accompanying drawings are preferably 

25 implemented in software, the actual connections between the system components or 
the process function blocks may differ depending upon the manner in which the 
present invention is programmed. Given the teachings herein, one of ordinary skill in 
the pertinent art will be able to contemplate these and similar implementations or 
configurations of the present invention. 

30 Although the illustrative embodiments have been described herein with 

reference to the accompanying drawings, it is to be understood that the present 
invention is not limited to those precise embodiments, and that various changes and 
modifications may be effected therein by one of ordinary skill in the pertinent art 
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without departing from the scope or spirit of the present invention. All such changes 
and modifications are intended to be included within the scope of the present 
invention as set forth in the appended claims. 
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CLAIMS 

1 . A video encoder (500) for encoding video signal data for an image block 
and a particular reference picture index, the encoder comprising a reference picture 

5 weighting factor assignor (572) for assigning a weighting factor corresponding to the 
particular reference picture index. 

2. A video encoder (500) as defined in Claim 1 , further comprising a 
reference picture store (570) in signal communication with the reference picture 

10 weighting factor assignor (572) for providing a reference picture corresponding to the 
particular reference picture index. 

3. A video encoder (500) as defined in Claim 1 , further comprising a 
variable length coder (540) in signal communication with the reference picture 

1 5 weighting factor assignor (572) for providing the particular reference picture weighting 
factor to the variable length coder. 

4. A video encoder (500) as defined in Claim 1 , further comprising a 
motion compensation unit (590) in signal communication with the reference picture 

20 weighting factor assignor (572) for providing motion compensated reference pictures 
responsive to the reference picture weighting factor assignor. 

5. A video encoder (500) as defined in Claim 4, further comprising a 
multiplier (592) in signal communication with the motion compensation unit (590) and 

25 the reference picture weighting factor assignor (572) for applying a weighting factor to 
a motion compensated reference picture. 

6. A video encoder (500) as defined in Claim 5 usable with bi-predictive 
picture predictors, the encoder further comprising prediction means for forming first 

30 and second predictors from two different reference pictures. 

7. A video encoder (500) as defined in Claim 6 wherein the two different 
reference pictures are both from the same direction relative to the image block. 
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8. A method (700) for encoding video signal data for an image block, the 
method comprising: 

receiving (712) a substantially uncompressed image block; 
assigning (714) a weighting factor for the image block corresponding to a 
5 particular reference picture having a corresponding index; 

computing (716) motion vectors corresponding to the difference between the 
image block and the particular reference picture; 

motion compensating (718) the particular reference picture in correspondence 
with the motion vectors; 
10 modifying (721) the motion compensated reference picture by the assigned 

weighting factor to form a weighted motion compensated reference picture; 

comparing (722) the weighted motion compensated reference picture to the 
substantially uncompressed image block; and 

encoding (724) a signal indicative of the difference between the substantially 
15 uncompressed image block and the weighted motion compensated reference picture 
along with the corresponding index of the particular reference picture. 

9. A method as defined in Claim 8 wherein computing motion vectors 
comprises: 

20 testing within a search region for every displacement within a pre-determined 

range of offsets relative to the image block; 

calculating at least one of the sum of the absolute difference and the mean 
squared error of each pixel in the image block with a motion compensated reference 
picture; and 

25 selecting the offset with the lowest sum of the absolute difference and mean 

squared error as the motion vector. 

10. A method as defined in Claim 8 wherein bi-predictive picture predictors 
are used, the method further comprising: 

30 assigning a second weighting factor for the image block corresponding to a 

second particular reference picture having a second corresponding index; 

computing motion vectors corresponding to the difference between the image 
block and the second particular reference picture; 
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motion compensating the second particular reference picture in 
correspondence with the motion vectors; 

multiplying the motion compensated second reference picture by the assigned 
second weighting factor to form a weighted motion compensated second reference 
5 picture; 

subtracting the weighted motion compensated second reference picture from 
the substantially uncompressed image block; and 

encoding a signal indicative of the difference between the substantially 
uncompressed image block and the weighted motion compensated second reference 
10 picture along with the corresponding index of the second particular reference picture. 

11. A method as defined in Claim 10 wherein the two different reference 
pictures are both from the same direction relative to the image block. 

l5 12. A method as defined in Claim 1 0 wherein computing motion vectors 

comprises: 

testing within a search region for every displacement within a pre-determ.ned 
range of offsets relative to the image block; 

calculating at least one of the sum of the absolute difference and the mean 
20 squared-error of each pixel in the image block with a first motion compensated 
reference picture corresponding to the first predictor; 

selecting an offset with the lowest sum of the absolute difference and mean 
squared error as the motion vector for the first predictor; 

calculating at least one of the sum of the absolute difference and the mean 
25 squared error of each pixel in the image block with a second motion compensated 
reference picture corresponding to the second predictor; and 

selecting an offset with the lowest sum of the absolute difference and mean 
squared error as the motion vector for the second predictor. 



30 
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